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Abstract. In 1991, Persi Diaconis and Daniel Stroock obtained two canonical path bounds on the second 
largest eigenvalue for simple random walk on a connected graph, the Poincare and Cheeger bounds, and they 
raised the question as to whether the Poincare bound is always superior. In this paper, we present some 
background on these issues, provide an example where Cheeger beats Poincare, establish some sufficient 
conditions on the canonical paths for the Poincare bound to triumph, and show that there is always a choice 
of paths for which this happens. 



Let G = G(X, E) be any simple, connected, undirected, and unweighted graph with edge set E and finite 
vertex set X. For each (x, y) £ X x X with x ^ y, choose a unique oriented path j XiV from x to y. Let T 
denote this collection of canonical paths (also known as a routing) and define 



Note that in the definition of b, the bottleneck number of (67, T), the maximum is taken over directed edges 
and so is to be distinguished from the related concept of the edge-forwarding index of (G,T) (see [5]). We 
distinguish directed and undirected edges by adorning the former with an arrow, and if e is an undirected 
edge connecting vertices x and y, we write e = {x, y}. Throughout this paper, the term path refers to a 
sequence of vertices where successive terms are connected by an edge. Repeated vertices are allowed, but 
no edge may appear more than once. (Some authors refer to such a path as a trail.) If ~et is an edge from 
x to y, we write 7 3 e, 7 3 ~£ if the vertex sequence defining 7 contains x and y as consecutive terms, x 
preceding y in the latter case. 

A simple random walk on G begins at some vertex Xq and then proceeds by moving to a neighboring 
vertex chosen uniformly at random. This defines a Markov process {A^.} with state space X. transition 
probabilities 



1. Background and Notation 



n = \X\ , d = maxdeg(a;), 7* = max I7I , and b = max [{7 € T : 7 9 e }| . 




otherwise 




and stationary distribution 



deg(x) 
2\E\ 
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This Markov chain is irreducible and reversible, thus the operator K defined by 

[K<t>]{x) = Y,K{x,y)ct>{y) 
yex 

is a self-adjoint contraction on L 2 (tt) with real eigenvalues 

1 = A>> Pi >--->/3n-l >-l 
whose corresponding eigenfunctions are orthogonal with respect to the inner product 

It is of interest to estimate /3* = max{/3i, as this quantity can be used to bound the r-step distance 

to stationarity. For example, when K is reversible with respect to n, letting K r x denote the distribution of 
X r given that Xq = x, we have the classical bound on the total variation distance to stationarity [2\ 

More generally, consideration of the Jordan normal form of the transition matrix shows that the 
exponential rate of convergence of any ergodic Markov chain is governed by the second largest eigenvalue 
(in magnitude). Because one can often ensure that /?j > - by adding holding probabilities, for 

example - much of the research has focused on bounding fii . 



In the above setting, Diaconis and Stroock [2] give the Poincare inequality 

A<1 2m 



\E\ 2 



and the Cheeger inequality 

/?1 - 1 aw 

These inequalities are corollaries of results that hold for all irreducible, reversible Markov chains and are 
based on geometric techniques (derived by their namesakes) for bounding the spectral gap of the Laplacian 
on a Riemannian manifold. Both ultimately rely on the variational characterization of the eigenvalues of the 
discrete Laplacian L = I — K: 



1-Pi= inf 



cj) nonconstant Var 7r (0) 



where £{d>. 6) is the Dirichlet form 



and 



£(<f>,<l>) = (<l>,I4) ir = \ E [<K*) - <Kv)]Mx)K(*>v) 

x,yeX 

Vzx„{4>) = {4>-E„{4>i<i>-EM)* = \ E M x ) ~ m?<x)<y) 

x,y£X 

is the variance of <f> with respect to 7r, = (4>, being the corresponding expectation. This character- 

ization follows easily from the Courant-Fischer theorem [5] and the properties of K. 

The use of canonical paths in this framework originated in the work of Jerrum and Sinclair on approxi- 
mating the permanent of a 0-1 matrix [6]. For the purposes of this exposition, we will only be considering 
the aforementioned inequalities for random walks on graphs, but it should be noted that they are both 



overestimates of the more general Poincare and Cheeger bounds. Also, observe that the variational charac- 
terization immediately gives lower bounds on f3i by evaluating the Rayleigh quotient at any nonconstant <f>. 
The interested reader is encouraged to consult [2] for the general bounds, their proofs, and the derivation of 
these particular cases for random walk on a graph. 

Though both inequalities are valid for any choice of T, their utility hinges upon a clever selection of 
canonical paths. In particular, one seeks to minimize 7*6 for Poincare and b for Cheeger. It was noted in [5] 
that the Poincare bound is often superior, regardless of the choice of T, but it was left as an open question 
whether this is always the case. A little algebra shows that this is equivalent to asking whether Ad 2 b > 7* \E\ 
for all choices of T. In addition to better understanding eigenvalue estimates for simple random walk on 
connected graphs, knowledge of the conditions under which the preceding inequality holds is of interest in its 
own right. For example, bounds relating standard graph theoretic quantities with measures of bottlenecking 
may be useful in applications involving network management or optimal distribution. 

Jason Fulman and Elizabeth Wilmer |3J have shown that Poincare beats Cheeger for simple random walk 
on trees, for which there is only one choice of canonical paths. They also show that for any vertex transitive 
graph, such as the Cayley graph of a group with a symmetric generating set, and any collection of paths T, 
one has the inequality bd 2 > D \E\ where D = diam(G). Thus Poincare beats Cheeger for random walk on 
vertex transitive graphs whenever 7* < AD. In particular, this implies that the Poincare bound is superior 
in these graphs when the paths are taken to be geodesies. 

For more background and examples, the reader is referred to and PJ. Other useful references include 
El El H] - By way of a counterexample, we answer the question posed by Diaconis and Stroock in the negative. 
Moreover, we extend the work of Fulman and Wilmer by providing more general criteria for the Poincare 
bound to prevail. 

2. A Counterexample 

Consider the case when G is a complete graph on n vertices. Then d = n — 1 and \E\ = (™). Now let 
7 be a Hamiltonian path with initial vertex xo and terminal vertex yo, and define a routing T by letting 
7 XlJ) be the unique (length 1) oriented geodesic from x to y for each (x,y) e X x X \ {(xo,y )} and letting 
7x ,3/ = 7- Then 7* = I7I = n — 1 and b — 2. The conjectured inequality is thus 

8(n - l) 2 = Ad 2 b > 7* \E\ = (n - 1) 

Because this fails when n > 17, there are infinitely many graphs where Cheeger beats Poincare for some 
choice of canonical paths, hence the Poincare bound is not uniformly superior to the Cheeger bound. Observe 
that since the left hand side is f2(n 2 ) and the right hand side is f2(n 3 ), the inequality cannot be salvaged by 
adjusting constants. Also, this particular choice of paths is kind of an overkill in that the argument actually 
shows that if one path in T has length greater than 16^9^ and the rest are geodesies, then we still get a 
counterexample. At the other extreme, observe that when n is odd, we can take the long path, 7, to be 
an Eulerian cycle with an edge deleted, and when n is even, we can take 7 to be an Eulerian cycle on a 
subgraph on n — 1 vertices with an edge deleted. These constructions show that counterexamples appear in 
all complete graphs on n > 7 vertices. 

Let us now examine the case of a complete graph on n > 3 vertices a little closer. The transition 
probabilities are given by 

K(x, y ) = ^—(l-6 x (y)), 
n — 1 
3 




so it is easy to see that G has 1 as a simple eigenvalue and its only other eigenvalue is — ^-j- with multiplicity 
n — 1. The preceding analysis shows that if we take T as above, then the Poincare bound is 

and the Cheeger bound is 

2 

A < i 



32(n- l) 2 ' 

Neither bound is remotely sharp, but Cheeger certainly comes out on top for n sufficiently large. If we take 
r to be the unique set of geodesies, in which case 7* = b = 1, then we obtain a Poincare bound of 

. 2 LEI 1 



c? 2 7*6 n — 1 

The corresponding Cheeger bound is 

\E\ 2 1 

1 - ^-ttU = 1- ^ 



2d 4 b 2 8 \n-l / 

Because 7* and b are small as they can be in this case, these are the best possible bounds. 

Thus even though we have an example where Cheeger beats Poincare (on a vertex transitive graph), it 
involves a choice of paths that yields terrible bounds while the best possible choice of paths gives an ideal 
Poincare bound that is much better than the optimal Cheeger bound. Perhaps then the relevant question 
is whether the best possible Poincare bound is always better than the best possible Cheeger bound. If the 
Poincare v. Cheeger conjecture is stated in terms of optimal bounds, then one can stipulate that the paths 
do not have repeated vertices as allowing them to contain cycles can only increase the bottleneck number and 
the longest path length. However, the counterexample shows that this restriction alone does not guarantee 
that the Poincare bound is uniformly better than the Cheeger bound. Moreover, for an arbitrary graph, 
there is no reason to suspect that the optimal Poincare bound is realized by a choice of paths which gives the 
optimal Cheeger bound since a choice of paths which minimizes 7*6 need not minimize b (and vice versa), 
so this question is more involved than asking when Ad 2 b > 7* \E\. Indeed, it seems that a resolution of this 
issue would require some characterization of the collections of canonical paths that yield the best bounds, 
which is probably both the most important and the most difficult problem associated with these path-based 
eigenvalue inequalities. For example, if tt(G, T) denotes the edge-forwarding index of (G, T), which is defined 
just like the bottleneck number but in terms of undirected edges, then it is known |3] that the problem of 
determining whether a given integer upper-bounds the minimum of n(G, T) over all minimal, symmetric, 
and consistent routings T is NP-complete when Diam(G) > 3. 

At this point, we observe that the construction of the counterexample involved a choice of canonical paths 
with one exceptionally long path and many shorter paths. This is no coincidence. For any choice of paths, T, 
there are M = X) 7 er ItI oriented edges counting multiplicity. Since there are a total of 2 \E\ oriented edges 
in the graph, the pigeonhole principle implies b > jfgfr- In terms of the average path length, 7 = ^ X) 7 er M' 
we get the inequality b > ^mt- (Note that we are including the n empty paths in this average.) 

This shows that a sufficient condition for Poincare to beat Cheeger is that 2d 2 n 2 ^ > 7* \E^ . Because 
2 I JE7 1 = J2 x ex deg(x) < dn, it follows that Poincare beats Cheeger whenever 87 > 7*. This idea was used 
implicitly in [5], but the author feels that it is significant enough to be stated directly as 

Theorem 1. For any simple connected graph G, if T is a set of canonical paths that satisfies 87 > 7*, then 
Ad 2 b > 7* \E\, hence the Poincare bound is superior to the Cheeger bound for this choice of paths. 



Thus in order for the Cheeger inequality to prevail, the longest path length must exceed the average path 
length considerably. When attention is restricted to geodesic paths, this shows that Poincare beats Cheeger 
whenever the mean distance is at least one eighth of the diameter. In addition to providing a nice general 
criterion, this observation shows that Poincare beats Cheeger along geodesies for a larger class of graphs 
than just those which are vertex transitive. 

Recalling the inequality b > where M = X) 7 er M' one sees that M is minimized when T is taken to 
be a collection of geodesies. This choice also minimizes 7*, though there could be other choices of canonical 
paths not consisting entirely of geodesies for which 7* = Diam(G). This suggests that geodesic paths are 
often a good starting point for finding optimal bounds. Of course ^gj is only a lower bound for b and there 
are often many choices for T such that all paths are geodesies, so this is in no way a sufficient criterion for 
optimization. Indeed, one can sometimes reduce b without increasing 7* by taking a slight detour in traveling 
between certain vertices. A little thought will show that this is the case in the following graph. 




Still, for the reasons indicated above and the fact that geodesies are often among the more obvious choices 
for canonical paths, it would be interesting to know more about when Poincare beats Cheeger along geodesies 
and when geodesic routings yield optimal bounds (as in the case of complete graphs) . All of the examples 
in (2] and [3] used geodesic paths, but the above graph shows that this is not always the best choice. 



In this section, we show that if T is taken to be the set of canonical paths along any spanning tree of any 
simple connected graph, then the Poincare bound is strictly better than the Cheeger bound. Specifically, 
we have d 2 b > 7* \E\ for such V. Moreover, the inequality is strict whenever the graph has more than two 
vertices, or equivalcntly, when d > 2. Since every connected graph has at least one spanning tree, this will 
show that there is always a choice of paths for which Poincare beats Cheeger. 

To begin, let G = G(X, E) be a simple connected graph on \X\ = n vertices and let T be any spanning 
tree for G. Then we can define T = T(T) to be the (unique) set of paths along T. We will show that Poincare 
beats Cheeger for such routings by first dealing with the case where 7* is large and then using the results of 
[3] to handle the remaining case. We note at the outset that as there is only one connected graph with d = 1 
and it is easily verified that d 2 b = 1 = 7* in this case, we can assume throughout that d > 2. Now let 
us say that that a routing is subordinate with respect to a path 7^ = vq, v\, v m if for all < i < j < m 
with Vi 7^ Vj, J Vi ,Vi = v %i v i+ii Vj. With this terminology, we can dispense with the large 7* case using the 
following lemma. 

Lemma 1. IfT is subordinate with respect to a longest path 7 and 7* = |7| > — 2, then d?b > 7* \E\. 
Proof. 



3. Spanning Trees 
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because if 7 = Vq, V\, u 7 „, then for all < i < j < 7*, 7^,^ = «i, Wj+i, Vj, so there are initial 
vertices and (7* + 1) — [ 7 * 2 f 1 j terminal vertices of paths through a central edge of 7. Consequently, if 
7* > - 2 (thus d 2 7, + 2d 2 > 4 then 

j2, ^ , 2 7*(7*+2) d 2 7. + 2d 2 

d 6 > d = 7* > 7* \E\ . 

□ 

Since a spanning tree routing is subordinate with respect to each of its paths, Lemma 1 shows that if 
r = r(T) is the set of canonical paths along a spanning tree T of G, then the Poincare bound is strictly 
better than the Cheeger bound whenever 7* > — 2. 

We now note that for any connected graph G on n vertices, we must have \E\ > n — 1. When \E\ = n— 1, 
G is a tree and there is only one choice of canonical paths. In this case, every path is subordinate with 
respect to the longest path, which has length 7* = \E\ = n — 1, so the first line in the proof of Lemma 1 
shows that 

- 4 4 1 1 

Since d > 2 whenever n > 2, we see that if G is a graph on n > 2 vertices with \E\ = n— 1, then d 2 b > 7* \E\. 
Consequently, we can assume henceforth that \E\ > n. 

For the remaining case, we appeal to Theorem 4 in [3 , the proof of which is included for completeness. 

Lemma 2. Let T be a tree with n vertices having maximal degree dx > 2 and bottleneck number bx- Then 

(n - l) 2 



b T > 



4 



Proof. Since T is a tree with n vertices, T has n — 1 edges, and for any oriented edge e, the number of pairs 
of vertices (x, y) with e € 7^ is of the form k(n — k), 1 < /c < S where deletion of edge e cuts T into two 
components of size A: and n — k, respectively. Because k(n — k) is increasing in k for 1 < k < 5, it is enough 
to show there exists an edge e* whose removal divides T into two components, the smaller of which has size 
at least ^— -. Given such an edge, we have 

> 'n-n / _ n-l\ > /(fr-l)(n-in > (n - I) 2 



dx J \ dx ) V rf 2 " / 

Thus we need only to demonstrate the existence of e* . To this end, note that if there is an edge which 
cuts T into two pieces of equal size, then we may take this edge to be e*. Otherwise, the deletion of any edge 
divides T into two pieces of size k and n — k, respectively, where k < n — k. In this case, we can orient the 
edges by directing each edge from its endpoint in the component of size k to its endpoint in the component 
of size n — k. Because there are n vertices and n — 1 edges, there must exist a vertex v* with indegree 0. 
Since deg(t>*) < dx, there must be some edge leaving v* whose deletion cuts T into two components, the 
smaller of which has size at least ^^r- Calling this edge e* establishes the result. □ 

Now by Lemma 1, we can assume that 7* < -^p- — 2. Also, since d > 2 implies there are at least 3 vertices 
in G (and thus in T), the maximal degree of the vertices in T is dx > 2. Because d > dx, Lemma 2 shows 
that 

b^b T >^X>J n - l)2 



4 - d? 



Therefore, since 2 |J5| = J2xex deg(x), thus 2 \E\ < nd, and we may assume also that \E\ > n, we have 



^ d 2 ; 1 1 d 2 1 1 - d 2 

We record the above result as 

Theorem 2. For any simple connected graph G, ifT is taken to be the set of paths along any spanning tree 
of G, then d 2 b > 7* \E\ for this choice ofT. If G has at least 3 vertices, then the inequality is strict. 

Since every connected graph has at least one spanning tree, we have the immediate corollary 

Corollary 1. For every simple connected graph, there is a choice of canonical paths such that the Poincare 
bound is strictly better than the Cheeger bound. 

It is worth pointing out that since trees are acyclic, these results also do not depend on whether repeated 
vertices are allowed. Of course, the above construction of T is certainly not optimal as it inflates both the 
bottleneck number and the longest path length by precluding certain combinations of paths. Moreover, 
different choices of spanning trees can yield different bounds. For instance, in the complete graph on 
n = 2m + 1 > 3 vertices, two possible choices for T are a Hamiltonian path or the geodesies emanating 
from some vertex xq. In the first case, we get the Poincare bound 

An 

Pl ~ (n-l) 3 (n+l) 

and the Cheeger bound 

2n 2 

ft < 1- 



(n- l) 4 (n + l) 2 ' 
while in the second case, the Poincare bound is 

A<1 



2(n- l) 2 

and the Cheeger bound is 

Pi < 1 



n 



8(n-l) 4 ' 

4. Concluding Remarks 

We have established that Poincare is not uniformly superior to Cheeger for all simple connected graphs and 
Cheeger is not uniformly superior to Poincare for any simple connected graph. Strictly speaking, this resolves 
the question put forth by Diaconis and Stroock, but it would be nice to know whether the best possible 
Poincare bounds are always better than the best possible Cheeger bounds. A more complete characterization 
of the choice of canonical paths for which Poincare beats Cheeger would likely prove helpful in settling this 
question. Lemma 1 and Theorems 1 and 2 offer partial results in this direction and subsume all previous 
findings, but the matter is still far from being resolved. Perhaps examining the case of geodesic routings 
more closely would shed some light on these issues. Also, since every connected graph can be obtained by 
adding edges to a spanning tree or deleting edges from a complete graph, it may be possible to obtain related 
results using surgery methods in conjunction with the above analyses of these extreme cases. Lastly, and 
probably of greatest practical importance, one would like to know more about how to find routings which 
yield the best bounds. 
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